Scaling entropy and automorphisms 
with purely point spectrum* 



A. M. Vershik 



To the memory of my friend Misha Birman 
Abstract 

We study the dynamics of metrics generated by measure-preserving 
transformations. We consider sequences of average metrics and e- 
entropies of the measure with respect to these metrics. The main 
result, which gives a criterion for checking that the spectrum of a 
transformation if purely point, is that the scaling sequence for the 
e-entropies with respect to the averages of an admissible metric is 
bounded if and only if the automorphism has a purely point spectrum. 
This paper is one of a series of papers by the author devoted to the 
asymptotic theory of sequences of metric measure spaces and its ap- 
plications to ergodic theory. 
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1 Introduction 

Among the many mathematical and nonmathematical problems we have been 
discussing with Misha Birman for many years after our acquaintance began in 
the early 1960s, the most intriguing one was the parallel between scattering 
theory and ergodic theory. Recently, I have returned to the (yet nonexis- 
tent) "ergodic scattering theory" and some forgotten questions related to it. 
However, this paper deals with another subject, which also correlates with 
M. Sh. Bir man's research. 

It is well known that the problem of deciding whether or not the spectrum 
of a given (say, differential) operator is discrete is quite difficult. Several 
remarkable early papers by M. Sh. Birman (in the first place, [1]) dealt with 
exactly this problem. In ergodic theory and the theory of dynamical systems, 
this problem (whether or not the system of eigenf unctions is complete) is also 
very difficult. In what follows, we use the terminology common in ergodic 
theory that slightly differs from the one adopted in operator theory: we use 
the term "discrete spectrum" as a synonym for "pure point spectrum." This 
is justified by the fact that a discrete spectrum in the sense of operator theory 
almost never appears in the theory of dynamical systems, since almost always 
one deals with unitary operators. 

As an example demonstrating the difficulty of this problem, we can men- 
tion the theory of substitutions, or stationary adic transformations [2J, on 
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which an extensive literature exists. The most intriguing problems concern 
nonstationary adic transformations with subexponential growth of the num- 
ber of vertices in the corresponding Bratteli-Vershik diagram (see [2]). The 
simplest and most popular example of such a transformation is the Pascal 
automorphism defined in [3]; in this case, the measure space (i. e., the phase 
space of the dynamical system) is the space of infinite paths in the Pascal 
graph endowed with a Bernoulli measure, and the transformation sends a 
path to its successor in the natural lexicographic order. In spite of many 
efforts, we still do not know whether the corresponding unitary operator has 
a discrete (i.e., pure point) or mixed spectrum, or, which seems most likely, 
its spectrum is purely continuous. Attempts to directly construct its eigen- 
functions failed; another approach, based on the characterization of systems 
with discrete spectrum in terms of Kushnirenko's sequential entropy, [10] has 
not been carried out. 

In the paper [TJ] (for more details, see (H [HI [21]), we suggested a new 
notion, the so-called scaling entropy, which generalizes the notion of Kol- 
mogorov's entropy. The main point is that we suggest to average the shifts 
of the metric with respect to a given transformation and then compute the 
e- entropy of the average metric. The class of increasing sequences of posi- 
tive integers that normalize the growth of this e-entropy over all admissible 
metrics does not depend on the choice of an admissible metric, so that the 
asymptotics of the growth of these sequences is a new metric invariant of 
automorphisms. Admissible metrics are measurable metrics satisfying some 
conditions that do or do not depend on the automorphism (see Section 2). It 
is important that we consider not merely the e-entropy of a metric, but the e- 
entropy of a metric in a measure space. Admissible metrics play the same role 
as measurable generating partitions in the classical theory of Kolmogorov's 
entropy according to Sinai's definition. At first sight, the difference between 
using partitions and metrics looks rather technical: a partition determines a 
semimetric of a special form (a so-called "cut semimetric," see [18J). How- 
ever, our approach has two important differences from the classical theory. 
First, we use the e-entropy of the iterated metric on a measure space rather 
than the entropy of a partition; this is a generalization of Kolmogorov's en- 
tropy, which allows one to distinguish automorphisms with zero entropy. The 
second, more important, difference is that we use the average metric (rather 
than the supremum of metrics, which corresponds to the supremum of parti- 
tions), which has no interpretation in terms of partitions and which contains 
more information about the automorphism than the supremum of metrics. 
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In this paper, we give a necessary and sufficient condition for an auto- 
morphism to have a discrete spectrum in terms of the scaling sequence. The 
condition is that this sequence is bounded. This result generalizes a theorem 
of S. Ferenczi [T5| ll~6] . who considered the measure-theoretic complexity of 
symbolic systems by analogy with the ordinary complexity in symbolic dy- 
namics. Our approach substantially differs from that of [T5]: we consider 
an arbitrary admissible metric rather than the Hamming metric only, and, 
which is most important, introduce the average metric and show that it is 
admissible in many cases and, in particular, for the Hamming metric. 

Thus criterion, i.e., the boundedness of the growth of the scaling entropy, 
should first be applied to adic transformations, e.g., to the Pascal automor- 
phism (see [1]). Although it is not yet applied to checking that the spectrum 
of the Pascal automorphism is not discrete, 0, the corresponding combina- 
torics is already developed and described in the recent paper [19], where a 
lower bound is obtained on the scaling sequence of the sup metric for the 
same Pascal automorphism. Supposedly, one can extend this bound (Inn) 
to the average metric using the same techniques. 

A wider context is presented in our papers (U [S] , where we suggest a plan 
for the study of the dynamics of metrics in a measure space as a source of 
new invariants of automorphisms. It is important that the notion of scaling 
entropy provides an answer to the question of whether or not the spectrum 
of a given transformation is discrete. 

Sections 2-4 are of general nature and are intended not only for the pur- 
poses of this paper, which is devoted mostly to automorphisms with discrete 
spectrum. Here we introduce our main objects: admissible metrics, aver- 
ages, scaling sequences, and the scaling entropy of an automorphism. In 
Section 5, we study the dynamics of metrics on a group and find conditions 
under which the average metric is admissible. The main result is given in 
Section 6, where we present a criterion for checking whether the spectrum of 
an automorphism is completely or partially discrete. In the last section, we 
sketch possible applications, links to the ordinary construction of entropy, 
and general remarks about the dynamics of metrics. 

1 See the footnote in the subsection 7.2 
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2 Admissible metrics 



We consider various metrics and semimetrics on a measure space (X, 21, p) . 
In what follows, it is assumed to be a standard space with continuous measure 
p and cr-algebra 21 of modO classes of measurable sets, i.e., a Lebesgue space 
with continuous measure in the sense of Rokhlin (see [24J). The space X x X 
is endowed with the cx-algebra 21 x 21 and the measure |iX|i. 

We define a class of (semi)metrics on a measure space, which plays an 
important role in what follows. 

Definition 1. A measurable function p : XxX — > M + is called an admissible 
(semi)metric if 

1) p is a (semi)metric in the ordinary sense on a subset X' C X of full 
measure (pX' = I), i.e., p(x,y) > 0, p(x,y) = p{y,x), p(x,y) +p(y,z) > 
p(x,z) for all triples (x,y,z) G X' x X' x X' , (p x p){(x,x),x G X'} = 0, 
and 



In order to formulate the next condition, observe that if p satisfies Con- 
dition 1 ), then the partition ip p of the space X into the classes of points 
C x = {y : p(x,y) = 0}, x G X , is measurable. Hence we have a well-defined 
quotient space X p = X/ip p endowed with the quotient metric denoted by the 
same letter p and the quotient measure p^ p = p p . For the quotient space 
(X p , p, p p ) , Condition 1) is still satisfied. 

2) The (completion of the) metric space (X, p) (if p is a metric) or (X p ,p) 
(if p on X is a semimetric) is a Polish (= metric, separable, complete) space 
with a B orel probability measure p (respectively, p p ). 

Following the measure-theoretic tradition, we must identify (semi)metrics 
(and hence the corresponding spaces) if they coincide almost everywhere as 
measurable functions on the space (X x X, px p). Of course, a (semi) metric 
that coincides almost everywhere with an admissible (semi)metric is admis- 



Condition 2) means that the cr-algebra of Borel sets in the metric space 
(X, p) (or (X p ,p) in the case of a semimetric) is dense in the cr-algebra 21 of 

2 We can define the notion of almost metric as a measurable function for which all axioms 
on metric satisfy for almost all pairs or triples points. As F.Petrov noticed for each almost 
metric there exists the admissible metric in our sense which almost everywhere coincided 
with it. 
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all measurable sets and, therefore, the measure p (respectively, p p ) is a Borel 
probability measure. It is obvious from the definition that a semimetric p is 
admissible if and only if the corresponding metric in the quotient space X p 
is admissible. 

An equivalent definition of an admissible metric is as follows: almost 
every pair of points can be separated by balls of positive measure, or, in 
other words, the a-subalgebra generated by the open balls modO separates 
points of the space. 

One can also formulate the admissibility condition in terms of the notion 
of a pure function from our paper [6]: 

Lemma 1. A metric p is admissible if and only if it satisfies Condition 1) 
and, regarded as a function of two variables, is pure in the sense of JE/; 
the latter means that the partition of the space on the classes of equivalence 
x ~ y p{z : p{x,z) = p(y,z)} = 1 is the partition on the separate points 
modO. In other words: the map x \-¥ p(x, .) is infective from measure space 
(X, p) to the classes of modO equal functions of one variable. 

If p is a semimetric, then this condition must hold for the metric p on 
the quotient space (X p ,p,p p ). 

Indeed, the purity condition implies that the a-algebra of sets generated 
by the balls separates points and hence is dense in the cx-algebra (X, 21, //); 
this also implies the separability. The converse immediately follows from the 
definition of an admissible metric. 

It is well known (see [21]) that if (X, p) is a Polish space, then every 
nondegenerate Borel probability measure /ionX turns (X, p) into a Lebesgue 
space. In other words, the metric p on a Polish space (X, p) endowed with a 
Borel probability measure p is an admissible metric on the space (X,p). 

As in other our papers, in the definition of an admissible metric we re- 
verse the tradition and consider various metrics and semimetrics on a fixed 
measure space rather than various measures on a given metric space. Recall 
that triples (X,p,p), consisting of a metric space endowed with a measure, 
in M. Gromov's book [5] were called mm-spaces, and in the paper [7], metric 
triples or Gromov triples. 

It is useful to regard admissible metrics as densities of some finite mea- 

3 In other words, almost every point is uniquely determined by the collection of distances 
from this point to all points of some set of full measure (which may depend on the point). 
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sures equivalent to the measure p x p on X x X: 



dM, 



p(x,y)dp(x)dp{y). 



If we set J XxX p(x, y)dp(x)dp(y) = 1, which can be done by normalizing 
the metric, then the new measure is also a probability measure. Observe the 
following important property of admissible metrics implied by this interpre- 
tation. 

Theorem 1. For almost every point x 6 X , there is a uniquely defined 
modO conditional measure p x on X, which is given by the formula dp x (A) = 
f A p(x,y)dp(y) for every measurable set A C X. The family of measures 
{p x ; x e X} satisfies the condition 



The metric p, regarded as a metric on the measure space (X,p x ), is admis- 
sible. 

Proof. Consider the new measure M MiP on the space X x X and the measur- 
able partition into the classes of points C x = {(x, *)} C X x X, and use the 
classical theorem on the existence of conditional measures (see, e.g., [21]), 
which implies the desired formula and the uniqueness modO of the family of 
conditional measures. Now consider the space (X, p, p x ) for a fixed x; the 
metric p on this space is admissible since the admissibility of a metric is 
obviously preserved under replacing a measure with an equivalent one. □ 

The conditional measure p x (A) can be interpreted as the "average dis- 
tance," or the conditional expectation of the distance from the set A to the 
point x. 

In these terms, the condition that the metric, regarded as a function of 
two variables, is pure means that the conditional expectations do not coincide 
mod for almost all pairs of points. 

Now we can give a convenient criterion of admissibility. 

Proposition 1. A measurable function p(-, •) satisfying Condition 1) of Def- 
inition 1 is admissible if and only if the following non- degeneracy condition 
holds: 
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for [i- almost all x, the measure of arbitrary balls of positive radius is 
positive: We > 0, p{y : p(x,y) < e} > 0, which is equivalent to: 



for almost all x and all positive e. 

Proof. The fact that an admissible metric satisfies the condition in question 
was observed above. Now assume that this condition is satisfied. We must 
prove that the space (X, p) (if p is a metric) or the quotient space X p (if p is 
a semimetric) is separable. It suffices to consider the case of a metric. 

The condition stated in the proposition implies that for every e there 
exists 5 = 5(e) with lim € _ > o ( K e ) = such that some set of measure > 1 — 5 
contains a finite e-net. But this means exactly that the space is separable 
and the measure is concentrated on a cr-compact set. □ 

Several examples. 1. An important example is the following class of 
semimetrics, which was in fact intensively used in entropy theory. Every 
partition £ of a space (X, p) into finitely or countably many measurable sets 
gives rise to a semimetric: 



where is the element of £ that contains z. In this case, X p is a finite or 
countable metric space. Such (finite) semimetrics are called cuts, and their 
linear combinations are called cut semimetrics (in the terminology of [T8]). 
It is easy to see that cut (semi)metrics are admissible. 

2. The very important metric defined by the formula p(x,y) = const for 
x 7^ y determines a discrete uncountable space. We will call it the constant 
metric. The constant metric on a space with continuous measure is not ad- 
missible. In this case, the cx-algebra generated by the open sets is trivial and 
does not separate points of the space, i.e., a point is not determined by the 
collection of distances to the other points. 

3. The condition defining an admissible metric can be strengthened by 
requiring, in condition 1), that 



P x ([0,e})>0 



Pt{x,y) = %(x),£(y)), 
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for p > 1; in this case, we say that the metric is p-admissible. Let us say 
that oo-admissible metrics (semimetrics) are bounded; this class of admissible 
metrics will be most useful in what follows. 

4. In combinatorial examples, it often suffices to consider metrics with 
which the space is compact or precompact (or, in the case of a semimetric, 
quasi-compact). For example, adic transformations [3] act in the space of 
infinite paths of an N-graded graph, which is a totally disconnected compact 
space. 

The set of admissible metrics on a Lebesgue space (X, p) with continuous 
measure is a convex cone 7Z(X, p) = 7Z with respect to the operation of tak- 
ing a linear combination of metrics with nonnegative coefficients. This cone 
is a canonical object (by the uniqueness of a Lebesgue space up to isomor- 
phism) and plays a role similar to the role of the simplex of Borel probability 
measures in topological dynamics. In many cases, it suffices to consider ad- 
missible (semi) metrics that produce compact spaces (after completion), but 
we do not exclude the case of a noncompact space. One may consider differ- 
ent topologies on the cone 7Z; the most natural of them is the weak topology, 
in which an e-neighborhood of a metric p is the collection of metrics 

jfl : (pxp){(x,y) : \p{x, y) - 6(x, y)\ < e} > 1 - e\. 

The property of being an admissible metric is invariant under measure- 
preserving transformations: if a metric p is admissible and a transformation 
T of the space (X, p) preserves the measure p, then the image p T of p, 
defined by the formula p T {-, •) = p(T-,T-), is also admissible. Thus the 
group of (classes of) measure-preserving transformations acts on the cone 7Z 
in a natural way. 

3 Average and maximal metrics 

Let T be a measure-preserving transformation (in what follows, it will be 
an automorphism). When considering automorphisms or groups of auto- 
morphisms in spaces with admissible semimetrics, it is natural to assume 
that there exist an invariant set of full measure on which the (semi) metric is 
admissible in the sense of our definition. 

In addition to the notion of admissible (semi) metrics, we define the class 
of T -admissible metrics. The T-admissibility condition must be invariant, in 
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the sense that if M. is the class of T-admissible metrics, then 

VM = {p : p(x, y) = pi{Vx, Vy), Pl e M) 

is the class of V r TV A_1 -admissible metrics in the same sense. Among many 
possible versions, we choose the class of admissible (semi) metrics for which 
T is a Lipschitz transformation almost everywhere: there exists a positive 
constant C such that for (p x /^-almost all pairs (x, y), the condition 

p(Tx,Ty) < Cp(x,y) 

holds; let us say that metrics from this class are Lipschitz T-admissible 
(semi)metrics. 

The choice of an appropriate class depends on the problem under con- 
sideration and the properties of the automorphism. For example, in the 
case of adic automorphisms, it is most convenient to consider the class of 
Lipschitz metrics. This class can also be defined for countable groups of 
automorphisms. 

For an arbitrary admissible semimetric p, we have defined the partition 
ipp of the space (X, p). In the same way, given an arbitrary automorphism T 
of the space (X,p), we consider the T-invariant partition ipj = \f^ =0 T k ip p . 
We say that the semimetric p is generating for T if ipj is the partition into 
separate points modO (which we denote by e). If p is the metric generated 
by a finite partition, then this is the ordinary definition of a generator (see 

TO- . 

Let us define the average metric and the sup-metric for a given automor- 
phism. 

Definition 2. Let T be an automorphism of a space (X,p), and let p be an 
admissible metric. 

The average metric p^ is defined by the formula 

1 n— 1 

P T n (x, y ) = -J2p( Tk ^ Tk y)- 

n — ' 

fc=0 

The sup-metric is defined by the formula 

Pl(x,y)= sup p(T k x,T k y). 

0<k<n 
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The following important result is a direct corollary of the pointwise er- 
godic theorem. 

Theorem 2. For any automorphism T and any admissible (semi)metric p, 
the limit of the sequence of average (semi)metrics which we denote by p, 
exists almost everywhere in the space (X x X, p, x p) : 

1 n— 1 

p T (x,y) = lim -J2p(T k x,T k y); 

n->oo n ^— ' 

p is a metric if and only if p is a metric or a generating semimetric. 

The existence of the limit follows from the fact that the integral f x f x p(x, y)dp(x)dp(y) 
is finite and the ergodic theorem. 

Definition 3. The metric 

n—l 

p T (x,y) = lim -Y^p{T k x,T k y) 

k=0 

(the limit exists p-a.e.) is called the average, or the I 1 -average of p with 
respect to the automorphism T . 
The metric 

p T (x,y) = sup p(T k x,T k x) 

k>0 

is called the limiting sup-metric of p with respect to T. 

It is clear that p T and p T satisfy all conditions of the definition of a 
(semi)metric; in what follows, we will consider only admissible metrics or 
generating semimetrics p, so that p T , and p T for an ergodic automorphism 
T, are metrics. The superscript T in the notation for the average and sup- 
metrics will be omitted if the automorphism is clear from the context. It is 
also obvious that p < p. However, the metrics p, p may not be admissible 
even if the (semi) metric p is admissible. 

In what follows, we will mainly consider the average metric p. In most 
interesting cases, namely if T is weakly mixing, i.e., its spectrum in the 
orthogonal complement to the space of constants is continuous, this metric 
is constant and hence not admissible; the sup-metric may be constant even 
for automorphisms with discrete spectrum. However, in all cases we will 
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be interested not in the limiting metrics themselves, but in the asymptotic 
behavior of the averages p^ as n — >■ oo. As we will see, automorphisms with 
discrete spectrum stand apart, since for them the average metric is often 
admissible. 

It is clear that p is nothing else but the projection of the function p, 
regarded as an element of the space L 1 (X x X,p x p), to the subspace of 
(T x T)-invariant functions, i.e., the expectation of the metric p with respect 
to the subspace of invariant functions on X x X. This space consists of 
constants if and only if T has no nontrivial eigenfunctions (in other words, 
T is weakly mixing). In this case, p is almost everywhere a constant, which 
is equal to the average p-distance between the points of the space X. At 
the same time, if T is not weakly mixing, then the spectrum of T contains a 
discrete component and p may be a nonconstant T -invariant (semi)metric. 
In this case, one may obtain bounds on p using Fourier analysis. 

Definitions and Lemma 1 imply the following lemma. 

Lemma 2. Let p be an admissible Lipschitz metric for an automorphism T; 
then the average metric p is also admissible and Lipschitz for T . 

We do not prove this assertion, because we do not need it in this paper. 
Here is an example of computing the average metric generated by the cut 
semimetric in the case of a rotation of the circle. 

Example. Let X = T 1 = R/Z, and let A G X be an irrational number. 
Consider the semimetric p(x,y) = \xa( x ) — Xa(u)\, where xa is the indicator 
of a measurable set A C X; the metric p(x, y) is T-admissible in the sense of 
our definition for the shift T\ by any irrational number A. The corresponding 
average metric is shift-invariant and looks as follows: p{x,y) = m[(A + 
x)Ap(A + y)} = m[AA(A + x — y)}; it is obviously admissible. 

4 Entropy and scaling entropy 

4.1 The e-entropy of a measure in a metric space 

Recall that the e-entropy of a compact metric space (X, p) is the function 
e i — y H p {e) whose value is equal to the minimum number of points in an e-net 
of(A». 
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Definition 4. The e-entropy of a measure space (X, p) with an admissible 
metric p is the function 

e ^ H Pi£ (p) = inf{H(u) : k p (v,p) < e}, 

where v ranges over the set of all discrete measures with finite entropy and 
k p (-,-) is the Kantorovich metric on the space of Borel probability measures 
on (X, p) . The entropy of a discrete measure v = £\ Ci5 Xi is defined in the 
usual way: H{y) = — ^qlncj. 

For our purposes, it is more convenient to use in the above definition 
another characteristic instead of H(p), namely, 

k 

H' p ^n) = min{lnfc : 3X',p(X') > 1 - e, 3{x i } k 1 : X' C |J^«>}> 

i=i 

where V £ (x) is the e-ball centered at x; thus {x±, x 2 , ■ ■ ■ , x^} is an e-net in X' . 
The finiteness of H' follows from the fact that a Borel probability measure 
in a Polish space is concentrated, up to e, on a compact set; H' is more 
convenient for computations than H . 

In this paper, we use the following simple inequality. 

Lemma 3. For every compact metric space (X, p) and every nondegenerate 
Borel measure p, the following inequality holds: 

H Pt{d+ i )£ (p) < H' p£ (p), 

where d is the diameter of the space. 

Proof. Assume that the diameter of the compact space does not exceed 1. As- 
sume that the measure of a set X' is greater than 1 — e and X' C Ui=i Ve{ x i)- 
Thus the points x±, . . . ,Xk form an e-net in X'. Consider the discrete mea- 
sure v supported by the points x±, . . . ,Xk with charges v(xi) equal to the 
measures p(V(xi)) of the corresponding balls (if two balls have a nonempty 
intersection, then we distribute the measure of the intersection proportion- 
ally between their centers). Choose an arbitrary point Xoo and set its charge 
equal to 1 — p(X'). Now consider the Monge-Kantorovich transportation 
problem with input measure p and output measure v. It is easy to see that 
we have in fact determined an admissible plan \& for this problem: the trans- 
portation from a point x G X' goes to the points for which x G V £ (xi), 
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and the remaining part of the measure p on the set X\X' goes to the point 
Xoq. It is easy to compute the cost of this plan; this gives a bound on the 
Kantorovich distance between the measures v and pi: 

k p (v, pi) < e(l — e) + e < 2e. 

On the other hand, we have H{y) < In A; = H' pe (pi). □ 



4.2 Scaling sequence and scaling entropy 

Let us define the notion of a scaling sequence for the entropy of an auto- 
morphism. If an automorphism T is fixed, we omit the superscript in the 
notation for the average entropy and white simply p n . 

Definition 5. Let T be an automorphism of a Lebesgue space (X,pi) with a 
T -invariant measure p. By definition, the class of scaling sequences for the 
automorphism T and a given (semi)metric p on X is the class, denoted by 
Hp^iT), of increasing sequences of positive numbers {c n ,n G N} such that 

U pe {T) = \{c n } : < liminf Hf>nM < limsup Hf>nM < oo\. 

In many cases, the class of scaling sequences for a given metric p does not 
depend on sufficiently small e. In this case, it is obvious that all sequences 
{c n } from "Hp(T) are equivalent. 

Definition 6. Assume that for a given ergodic automorphism T of a space 
(X,p) there exists a (semi)metric p such that the class of scaling sequences 
for p is the maximal one (i.e., for any other (semi)metric, sequences {c n } 
from the corresponding class grow not faster than for p). In symbols, we 
write this fact as 

n P0 (T) = suvn p (T). 
p 

Then we say that "H po (T) is the class of scaling sequences for the automor- 
phism T and the metric po is T -maximal. 

It seems that such a metric exists for every automorphism. If we have 
chosen some T-maximal scaling sequence and the corresponding limit of en- 
tropies does exist, then it is called the scaling entropy. 
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Conjecture 1. For every ergodic automorphism T, a generic T -admissible 
Lipschitz metric is T -maximal. In particular, for a K- automorphism (i.e., 
an automorphism with completely positive entropy), the scaling sequence is 
equivalent to the sequence c n = h{T)n, where h(T) is the entropy of T , for 
every Lipschitz metric. 

A preparatory result in this direction was obtained in [T7] . 

In this paper, we will prove that for an automorphism with purely discrete 
spectrum and a T-admissible metric, the class % p (T) of scaling sequences is 
the class of bounded sequences. 

5 Invariant metrics on groups and averages 
of admissible metrics 

5.1 Invariant metrics and discrete spectrum 

Let us recall some known facts about ergodic automorphisms with discrete 
spectrum. It obviously follows from the character theory of commutative 
groups that the spectrum of a translation on a compact Abelian group is 
discrete. By the classical von Neumann theorem, the converse is also true: 
an ergodic automorphism with discrete spectrum is metrically isomorphic to 
the translation T on a compact Abelian group G endowed with the Haar 
measure m by an element whose powers form a dense subgroup: 

x i — y Tx = x + g, C\{ng, n e Z} = G 

(we use the additive notation). Note that on a compact group G there are 
many metrics that are invariant under the whole group of translations and 
determine the standard group topology. We will need the following assertion 
(which is, possibly, partially known). 

Proposition 2. The spectrum of an ergodic automorphism T of a measure 
space (X, p) for which there exists a T -invariant admissible semimetric p 
contains a discrete component. Moreover, if p is a metric, then the spectrum 
of T is discrete and, consequently, T is isomorphic to a translation on a 
compact Abelian group. 

Proof. Since an admissible (semi)metric lies in the space L 1 (A x X, p x p), 
the tensor square of the operator Ut, which corresponds to the automorphism 
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T x T, has nonconstant eigenfunctions. This can happen only if the spectrum 
of the unitary operator Ut contains a discrete component, and the first claim 
is proved. In other words, T is an extension of some quotient automorphism 
with discrete spectrum, which may coincide with T itself. This means that 
T is a skew product over a base with discrete spectrum. Denote by H C 
L 2 (X,p) the subspace spanned by all eigenfunctions of Ut- All invariant 
functions of the operator Ut <8> Ut belong to the tensor square H <g> H; hence 
these functions, regarded as functions of two variables, do not change when 
the argument ranges over a fiber of the skew product. But if these fibers are 
not single point sets, it follows that the metric does not distinguish points 
in fibers and hence is a semimetric. Thus if p is a metric, then each fiber 
necessarily consists of a single point, the spectrum of T is purely discrete, 
and, by the ergodicity, T is isomorphic to a translation on a group. □ 

Let us supplement this proof with an important refinement. Assume that 
the automorphism T has an orbit that is everywhere dense with respect to 
the metric p, i.e., T is topologically transitive (though we do not assume 
that it is a priori continuous). It is clear from above that this condition 
follows in fact from the existence of an invariant metric. Since the metric is 
admissible, we may assume without loss of generality that (X, p) is a Polish 
space. Consider the dense orbit O = {T n x, n G Z} of some point x. The 
restriction of p to O is a translation-invariant metric on the group Z, and all 
translations are isometries. Hence the completion of O is an Abelian group 
to which we can extend the translations and their limits. Therefore X is 
a Polish monothetic group o Obviously, the measure p is invariant under 
the action of the closures of powers of T, i.e., it is an invariant probability 
measure on the whole group. Hence, by Weil's theorem, X is a compact 
commutative group and T is the translation by an element whose powers are 
everywhere dense. 

5.2 Admissible invariant metrics 

As we have already observed, for weakly mixing automorphisms, the aver- 
age of every metric is constant, since there are no other invariant metrics. 

4 A topological group that contains a dense infinite cyclic subgroup is called monothetic. 
Note that there are many non-locally compact monothetic groups on which there is an 
invariant metric, but there is no invariant measure. A recent example is the Urysohn 
universal space regarded as a commutative group, see [9]. 
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However, for automorphims whose spectra contain a discrete component or 
are purely discrete, there are many invariant (semi)metrics. Hence, in order 
to study such automorphisms, we should investigate the question when the 
average metric for an automorphism T with a discrete spectrum is admissible. 

Given a translation-invariant metric p on a compact commutative group 
with the Haar measure, consider the function <f> p (r) = p(x, x+r) = p(0, r). In 
the example from Section 3, it looked as 0(r) = p(x,x + r) = m[AA(A + r)]. 

One can easily write down necessary and sufficient algebraic conditions 
on a measurable function that guarantee that it can be written as P for a 
measurable invariant metrics: 

0(0) = 0, <t>{x) > 0, <j>{-x) = 0(x), 4>{x) + <j>{y) > <t>{x + y). 

We will not need these conditions; the only important fact is that the 
admissibility condition from Lemma 1 can easily be reformulated in terms of 
this function. 

Theorem 3. An invariant measurable (semi) metric on a commutative com- 
pact group (satisfying Condition 1 from Definition 1) is admissible if and 
only if any of the following conditions holds. 

1. The corresponding function P is measurable, and 

p{z : <p p (z) ^ <p p {g + z)} > 

for almost all g. 
2. 

essmf 9eno <MfiO = 0, 

where V is an arbitrary neighborhood of the zero of the group in the standard 
topology. 

Proof. Condition 1 is exactly equivalent to the condition of Lemma 1. It is 
useful to give a direct proof that this condition is necessary. Assume that it is 
not satisfied, i.e., for all elements g from some set of positive Haar measure, 
4> p (z) = (f) p (g + z) for almost all z. Therefore, the measurable function 
is constant on cosets of the subgroup generated by g. However, every set 
of positive Haar measure in a nondiscrete Abelian compact group on which 
there is an ergodic translation contains an element g that generates a dense 
cyclic subgroup. Then, since the function p is measurable, it follows that it 
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is constant almost everywhere. But if the function p is constant, then the 
metric p is also constant and hence not admissible. 

Let us prove that Condition 2 of the theorem is necessary. Consider the 
function <p p for an invariant admissible metric p. Assume that 9 = ess inf from 
Condition 2 is positive; then, using the invariance of p, we can construct a 
continuum of points lying at a fixed positive p-distance greater than 9, which 
contradicts the admissibility. For the same reason, the set of values of <f) p 
is dense in some neighborhood of the zero. The fact that Condition 2 is 
sufficient follows from Proposition 1. □ 

The last assertion implies the following corollary. 

Corollary 1. For every admissible invariant metric there exists a sequence 
of group elements that converges to zero both in the standard topology and 
with respect to the (semi)metric. 

5.3 Admissibility of the average metric 

Now we can explicitly write down the condition that guarantees the admis- 
sibility of the average, i.e., invariant, metric in terms of the original metric. 
Consider an arbitrary measurable (non necessarily admissible) metric p on a 
compact commutative group G and an ergodic translation T by an element 
g. Let us write down an expression for the average metric: 

1 n f 
P T (x,y) = lim - V p(x + kg, y + kg) = / p(g + x, g + y)dm(g) 



(we have used the fact that the measure m is invariant). Hence the function 
0(r), regarded as a function of r, is measurable and has the form 



Obviously, p is a measurable function on the group G x G. Now we can 
check the admissibility condition for the average metric p. 





G 



p(z, z + r)dm(z) = p(x, y); y — x = r. 
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Definition 7. We say that an admissible metric p on a compact commutative 
group G with Haar measure m is semicontinuous at zero in mean if 



we say that it is semicontinuous at zero in measure if the following condition 
holds (in which "meas" means convergence in measure): 



Note that the second condition follows from the first one, and that both 
conditions are stated in purely group terms, i.e., do not depend on the par- 
ticular translation T. 

Thus we have the following admissibility criterion for the average metric. 

Proposition 3. The average metric p T for an ergodic translation T on 
a compact commutative group G is admissible if and only if the original 
(semi)metric p is admissible and semicontinuous at zero in mean. 

Proof. The "if" part is proved above; the "only if" part follows from the 
equality <f>p(r) = f G p(x,x + r)dm(x) and the previous proposition. □ 

Now we are ready to formulate and prove the following important fact. 

Theorem 4. For every bounded admissible metric on a compact commutative 
group G, the average metric is admissible. 

Proof. Assume that the metric p is not semicontinuous at zero and there 
exists a positive number c > such that 



Assume that the metric is normalized so that the diameter of the space 
X is equal to 1. Then it follows from our assumption that for sufficiently 
small (i.e., belonging to a small neighborhood of the zero in the group G) 
r there exists a (depending on r) subset in X of measure a, which does 
not depend on r and is greater than |, on which p(x,x + r) > |. Indeed, 
1 • a + (1 — a) | > f G p(x, x + r)dm(x) > c. But since the group is compact, 
there is a set of positive measure for all points x of which the inequality 




liminf (meas)p(x, x + r) = 0. 
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p(x, x + r) > | holds for all sufficiently small r from some set of positive 
measure. This in turn contradicts the admissibility of the metric p in the 
formulation of Proposition 1: arbitrarily small values </>(r) for small r cannot 
interlace with values greater than | by the triangle inequality. Thus we have 
proved that p is semicontinuous. □ 

Question. Does there exist an unbounded admissible metric p on the circle 
S 1 = K./Z for which the average metric p is not admissible? Does there exist 
an unbounded p-admissible metric, with 1 < p < oo, on a compact Abelian 
group for which the average metric is not admissible?^] 

6 Criterion for the discreteness and continu- 
ity of the spectrum in terms of the scaling 
entropy 

Now we formulate our main result. 

Theorem 5. For an ergodic automorphism with discrete spectrum realized 
as a translation on a compact commutative group with an arbitrary bounded 
admissible metric, or, more generally, with a metric for which the average 
metric is admissible, the scaling sequence is bounded. 

Proof. Since the set B £ = </> -1 ([0,£:]) is, by definition, the ball of radius e 
centered at G G in the metric p, which is admissible by assumption, it 
follows that mB £ > 0, since the metric is nondegenerate (see the definition 
of an admissible metric). But the sum B e + B e , like the sum A + A for 
every set A of positive Haar measure in a locally compact group, contains 
a neighborhood V of the zero in the standard topology (see, e.g., [28J). It 
follows from the triangle inequality that B £ + B £ C B 2e , so that V C -E^s- 
Since e > is arbitrary, we see that the topology on G determined by the 
average metric p coincides with the standard topology, i.e., in the topology 
determined by p, the group G is compact and contains a finite e-net for every 
e. 

s While the paper was in press, F. Petrov and P. Zatitskiy proved that the average 
of any (in particular, unbounded) admissible metric on the circle is admissible. Thus 
the additional assumptions that the average metric is admissible in Proposition 2 and 
Theorem 5 are superfluous. 
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The pointwise a.e. convergence 




lim - > p(x + kg,y + 
•woo n z — ' 

fc=0 




which follows from the pointwise ergodic theorem, implies that the number 
of points in an e-net for (G, p n ) tends to the number of points in an e-net for 
(G,p). This means that the sequence H PnjE (X) converges to H P ^{G). From 
the inequalities of Lemma 3 we see that the sequence H Pni£ (fj) is bounded as 
n — > oo, and thus the scaling sequence for the automorphism T, which acts 
on the metric triple (G,p,m), is bounded. □ 

Combining this theorem with the previous one, we obtain the following 
result. 

Theorem 6. An ergodic automorphism T has a discrete spectrum if and 
only if the scaling sequence for T is bounded for some, and hence for every, 
bounded admissible metric. 

Proof. Above we have proved that if an automorphism has a discrete spec- 
trum and the average metric is admissible, then the scaling sequence is 
bounded. But the average metric is always admissible provided that the 
original metric is bounded and admissible. 

Assume that the scaling sequence is bounded for an automorphism T and 
an admissible metric p. Recall that the average metric is indeed a metric (and 
not a semimetric). Consequently, the space (X,p) is precompact, and hence 
the metric p is admissible. Since it is T- invariant, it follows from Theorem 1 
that T has a purely discrete spectrum. □ 

Combining the last theorems with the previous results yields a criterion 
for the discreteness and continuity of the spectrum in terms of the automor- 
phism T and an arbitrary admissible metric. 

Theorem 7. Let T be an ergodic automorphism, and let p be a bounded ad- 
missible (semi)metric. If the corresponding scaling sequence is not bounded, 
then the spectrum of T contains a continuous component. If the scaling se- 
quence is not bounded for every admissible (semi)metric, then the spectrum 
ofT is purely continuous. 

In the next section, we will show how one could apply this criterion. 
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7 Comparison with the traditional approach, 
the Pascal automorphism, and concluding 
remarks 

7.1 Supremum metrics 

The entropy theory of dynamical systems, developed mainly by Kolmogorov, 
Sinai, and Rokhlin, essentially uses the tools of the theory of measurable par- 
titions. In Sinai's definition, the entropy appears as an asymptotic invariant 
of the dynamics of finite partitions under the automorphism: 

n n 

As a result of this theory, the study of the class of automorphisms with 
completely positive entropy was differentiated into a separate field, whose 
methods do not apply to automorphisms with zero entropy. For example, 
one cannot obtain a new invariant for such automorphisms following the 
same scheme. This can be seen from the following simple fact. 

Proposition 4. For every transformation T and every increasing sequence 
of positive numbers {c n , n e N} satisfying the condition lim n ^ = 0, there 
exists a generating partition £ such that 

hm — v fc ~ 1 — = oo. 

n c n 

This means that the maximum growth of the entropy H{\J^ =l T fc £) either 
is linear (for automorphisms with positive entropy), or, in the case it is 
sublinear, it is arbitrarily close to linear for every automorphism. Thus we 
obtain no new information. 

The metric corresponding to the supremum (product) £ n = V&=i T k l; of 
partitions is the supremum of the shifted metrics: p^(x, y) = sup 0<fc<n p(T k x, T k 
Hence, following our plan, we can use the e-entropy of the metric p^(x,y) 
instead of the entropy of the partition £ n itself. Then, using the definitions 
from Section 4, for a given metric p we can introduce an analog of the function 
H PtE (T) with the metric p n replaced by p n : 

H p>e (T) = {{c n } : < \imM HpnM < limsup HpnM < oo). 

L n— >oo Cn n-s.no Cn J 
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In this way we define the class of sup-scaling sequences c n for a given 
metric p. This also allows us to extend the classical entropy theory following 
the above scheme. Though it is somewhat easier to deal with the sup-metric 
than with the average metric, the former is much less useful than the latter. 
The metric p more often happens to be constant for an automorphism with 
discrete spectrum, while, as we have seen, p is always admissible if the original 
metric is bounded. Let us illustrate the important difference between the 
operations of taking the average and supremum metrics by the following 
example. 

Example. Let T be an irrational rotation of the unit circle, and let p 
be the semi-metric corresponding to a generating two-block partition (i.e., a 
partition into two sets of positive measure); the semi- metric p is T-admissible, 
hence, as we have seen, p is an invariant admissible metric. At the same 
time, p is the constant metric. This means that the scaling sequence c n is 
bounded, but c n (the scaling sequence for the sup-metric) is not; namely, we 
have c n ~ Inn. Thus the difference manifests itself even in the case of a 
discrete spectrum. 

Does in make sense to use intermediate averages, e.g., the l p - averages 



r1 n_1 i ~ 

\im\-J2 P(T k x,T k y y v =p p (x,y) 



m 

k=0 



for p G (l,oo), instead of Z 1 ? Apparently, they do not lead to any new 
effects: these metrics behave in the same way as the ^-average metric. For 
instance, in the above example, the p-average of the metric determined by a 
two-block partition of the unit circle is p p (x,y) = {m[AA(A + x — y)]}p . For 
p = 00 (i.e., the sup-metric), the picture is completely different, as in other 
interpolation theories. Thus in entropy theory, the use of average metrics 
substantially supplements the classical considerations. 



7.2 Application of the discreteness criterion 

As noted above, the problem of determining whether or not the spectrum 
of an automorphism is discrete, is not at all simple. Theorems 5-7 provide 
convenient non-spectral criteria for checking that the spectrum is not purely 
discrete; for this, one should bound the entropy from below for one admis- 
sible metric satisfying the conditions of Section 2 by a sequence that grows 
arbitrarily slow with n. 
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One of the intriguing examples of automorphisms for which the discrete- 
ness of the spectrum has not been neither proved nor disproved since the 
1980s is the Pascal automorphism. It was introduced by the author in 1980 
(see [21 E]) as an example of an adic transformation, and is defined as a nat- 
ural transformation in the space of paths in the Pascal graph regarded as a 
Bratteli-Vershik diagram with lexicographic ordering of paths. One can give 
a short combinatorial description of this transformation by encoding these 
paths with sequences of zeros and ones and identifying the space of paths 
with the compact space X = {0; 1}°° = Z2. Then the Pascal automorphism 
is defined by the formula 

T({1W1**} = {O^l^O**}); 

here i > 0, j > 0, and the domain of T and T" 1 is the whole X except for 
the countable set of sequences having finitely many zeros or ones. The most 
natural metric on X is the 2-adic metric p({xk}, {yk}) — 2 _n , where n is 
the first digit with Xk ^ yk- This metric is admissible, and the Pascal auto- 
morphism satisfies the Lipschitz condition almost everywhere. The orbits of 
this automorphism coincide with the orbits of the action of the infinite sym- 
metric group. The Bernoulli measures are T- invariant. The spectrum of the 
Pascal transformation was studied in the papers [211 [221 [231 12S] , where some 
interesting properties were established (e.g., it was proved that T is loosely 
Bernoulli, the complexity of T was computed, etc.), but the question about 
the type of the spectrum remains open. U In [HE] it was conjectured that the 
study of the behavior of scaling sequences may turn to be useful. The corre- 
sponding plan was carried out in [T9] , but in that paper a logarithmic lower 
bound was obtained on the scaling sequence for the sup-metric, and not for 
the average metric; this is not sufficient for the conclusion that the spectrum 
is not discrete. Nevertheless, one may hope that the combinatorics developed 
in [19] will help to prove that the scaling sequence is unbounded also for the 
e-entropy of the average metric, which, by our theorem, would imply that 
the spectrum is not discrete. There are many adic transformations similar to 
the Pascal automorphism for which the same question is also of great inter- 
est. For example, if we replace the Pascal graph with its multidimensional 
analog or the Young graph, we will obtain automorphisms that supposedly 
have continuous spectra. As observed above, in order to prove that there are 

6 Note on the translation: The answer is known now — the spectrum of Pascal auto- 
morphism is continuous. 
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no nontrivial eigenfunctions, one should obtain a growing lower bound on 
the scaling sequence not for one, but for all (or for some representative set 
of) bounded admissible (semi) metrics. 

7.3 The dynamics of metrics 

Recall that the general approach that consists in studying the asymptotic 
behavior of metrics is not exhausted by considering the asymptotics of the 
^-entropy of the average or supremum metric, i.e., does not reduce to study- 
ing the growth of scaling sequences; this is only its simplest version. In fact, 
we consider the original measure space (X, 21, p) with an action of an auto- 
morphism T (or a group of automorphisms G), fix an appropriate metric p, 
and study the sequence of metric triples 

. n— 1 

(X,pZ,n), where p T n {x,y) = - V p(T k x, T k y). 

k=0 

The conjecture is that, for a fixed measure and a fixed automorphism 
(or group of automorphisms), the asymptotic properties of this sequence of 
metric triples do not depend (or weakly depend) on the choice of an individual 
admissible metric from a wide class. These properties include not only the 
scaling entropy, but also more complicated characteristics of the sequence, 
say the mutual properties of several consecutive metric triples. Since the 
classification of metric triples up to measure-preserving isometry is known 
(see [SI one may hope to apply it to this problem. In this field there 
are many traditional and nontraditional questions. For example, what is the 
distribution of the fluctuations of the sequence of average metrics, regarded as 
functions of two variables on (X x X, p xp), as they converge to the constant 
metric (for weakly mixing transformations, e.g., i^-automorphisms) ? What 
can be said about the asymptotic properties of neighboring pairs of metric 
triples (with indices n and n + 1)1 Etc. 

In conclusion, it is worth mentioning that the concept of scaling entropy 
appeared in connection with the classification of nitrations in [H] and was 
used in [27J. In terms of the present paper, the scaling entropy for nitrations, 
i.e., decreasing sequences of measurable partitions or a-algebras, is the scaling 
entropy for an action of a locally finite group such as J^Z/2 instead of an 
action of Z considered here. The definitions we have given for an action of 
Z essentially coincide with those given in [H] for locally finite groups. 
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